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The fundamental gap is a central qnantity iir the electronic structure of matter. Unfortunately, the 
fundamental gap is not generally equal to the Kohn-Sham gap of density functional theory (DFT), 
even in principle. The two gaps differ precisely by the derivative discontinuity, namely, an abrupt 
change in slope of the exchange-correlation (xc) energy as a function of electron number, expected 
across an integer-electron point. Popular approximate functionals are thought to be devoid of a 
derivative discontinuity, strongly compromising their performance for prediction of spectroscopic 
properties. Here we show that, in fact, all exchange-correlation functionals possess a derivative 
discontinuity, which arises naturally from the application of ensemble considerations within DFT, 
without any empiricism. This derivative discontinuity can be expressed in closed form using only 
quantities obtained in the course of a standard DFT calculation of the neutral system. For small, 
hnite systems, addition of this derivative discontinuity indeed results in a greatly improved prediction 
for the fundamental gap, even when based on the most simple approximate exchange-correlation 
density functional - the local density approximation (LDA). For solids, the same scheme is exact 
in principle, but when applied to LDA it results in a vanishing derivative discontinuity correction. 
This failure is shown to be directly related to the failure of LDA in predicting fundamental gaps 
from total energy differences in extended systems. 


I. INTRODUCTION 

Density-functional theory (DFT) [H-Q is the leading 
theoretical framework for studying the electronic proper¬ 
ties of matter. It is based on mapping the interacting- 
electron system into the Kohn-Sham (KS) system of non¬ 
interacting electrons, which are subject to a common ef¬ 
fective potential. DFT is a first principles approach, i.e. 
the only necessary input for the theory is the external 
potential, Vext(/), and the total number of electrons, TV; 
no experimental data are required. In principle, the KS 
mapping is exact. In practice, it involves an exchange- 
correlation (xc) density functional, £'a;c[n(r)], whose ex¬ 
act form is unknown and is always approximated. 

Present-day approximations within DFT already make 
it widely applicable to a variety of many-electron sys¬ 
tems in physics, chemistry, and materials science [l0l - ll4| . 
Specifically, quantities that can be derived from the total 
energy of the system, notably structural and vibrational 
properties, can often be obtained with a satisfactory ac¬ 
curacy of a few percent or better. However, extraction 
of quantities such as the ionization potential (IP) or the 
fundamental gap directly from the KS eigenvalues often 
results in serious discrepancies with experiment (see, e.g.. 
Refs. [il-iH). 

It is by now well-known (23 - 1^ that for the exact 
(but generally unknown) xc functional the highest oc¬ 
cupied [ho) KS eigenvalue would equal the negative of 
the ionization potential of the interacting-electron sys¬ 
tem. This result is known as the ionization potential 
theorem. However, the KS gap, , namely the energy 
difference between the lowest unoccupied {lu) and ho KS 
eigenvalues, would still not equal the fundamental gap of 
the interacting system, Eg. This difference is due to a 
finite, spatially uniform “jump” in the KS potential, ex¬ 


perienced as the electron number, N, crosses an integer 
value. This “jump” is called the derivative discontinu¬ 
ity, A, for reasons discussed in detail below. Extensive 
numerical investigations have shown [H, [13, E m, 113 
that the value of A associated with the exact KS poten¬ 
tial for various systems of interest is not at all negligible 
in comparison to E^^. It is commonly understood that 
standard semi-local xc potentials, like the local density 
approximation (LDA) |28 M3(| or generalized-gradient ap¬ 
proximations (GGAs) [31I4^ . do not possess a derivative 
discontinuity by construction. As a result, such approxi¬ 
mations effectively “average over” it in the vicinity of the 
integer point [13, [13| ■ Gonsequently, the ionization po¬ 
tential theorem is grossly disobeyed and in addition the 
fundamental gap is greatly underestimated. 

For finite systems, a correct positioning of the ho and 
the lu KS eigenvalues is highly advantageous when de¬ 
scribing processes like ionizatiom photoemission, charge 
transport or transfer, etc. [39l - l4^ . But at least the ion¬ 
ization potential and electron affinity, and ergo the fun¬ 
damental gap, which equals the difference between the 
two, can be calculated based on total energy differences 
between neutral, cation, and anion (see, e.g., (45l - [5lj |^ 

For periodic systems, e.g., crystalline solids, this is no 
longer the case. Because such systems are represented 
by a unit cell with periodic boundary conditions, vary¬ 
ing the number of electrons per unit cell means adding 
or subtracting charge from each replica of the unit cell 
and therefore an infinitely large charge from the system 
as a whole. The ensuing divergence is usually avoided 
by introducing a compensating uniform background to 
the unit cell, keeping the overall system neutral. How¬ 
ever, this hinders the straightforward use of total energy 
differences for deducing fundamental gaps, as discussed, 
e.g., in Refs. [H, [s^j- Thus, there is a clear advantage 
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in extracting the fundamental gap based on KS eigenval¬ 
ues and other quantities arising in the calculation of the 
neutral system itself, without alteration of the number of 
electrons. 

Many novel approaches have been developed within 
DFT for improving the accuracy of fundamental gap pre¬ 
diction beyond that afforded by conventional semi-local 
functionals, with an emphasis on applications to crys¬ 
talline solids. These can be broadly divided into several 
categories. 

Within the KS framework, significant attention has 
been devoted to the employment of functionals that do 
possess an inherent derivative discontinuity. Much of the 
effort involved the exact-exchange (EXX) functional, us¬ 
ing the optimized effective potential (OEP) [11, 
approach [ssl - f^ . More recently, novel semi-local func¬ 
tionals were constructed so as to mimic EXX-OEP prop¬ 
erties [614^ . 

Alternatively, it is possible to step outside the KS 
framework and use the generalized KS (GKS) scheme [H, 
m, m, [tII , where mapping to a partially interacting elec¬ 
tron system results in a KS-like equation that includes a 
non-multiplicative potential operator. This operator can 
reduce the magnitude of the derivative discontinuity, po¬ 
tentially driving it down to negligible values. Practical 
GKS schemes often rely on the Pock operator, or variants 
thereof. Such GKS calculations were performed, e.g. , 
by employing the screened exchange approach 1691 ITII- 
[7a| . by using global hybrid functionals [7ftl84 ISdi - ISq . 
range-separated hybrid functionals [13, ISlL l87l - l96j| . and 
by applying a scaling correction to the Hartree and ex¬ 
change functionals |9^ . Alternatively, one can step 
outside the KS scheme by introducing orbital-specific cor¬ 
rections, where different electrons of the KS system are 
subject to different potentials. This is achie ved, e.g ., 
using self-interaction correction methods m, ioEfioit. 
DFT -I-U and Koopmans’ c omp liant functionals |l02l - 
llll| . the LDA -1/2 method [ll^ . Fritsche’s generalized 
approach [ll3l Ill4| , or a scissors-like oper ator to the KS 
system that affects only the vacant states ini. 

A different possibility altogether is to sidestep the full 
charging problem by considering total energy differences 
arising from appropriately constructed partial charging 
schemes, e.g., by employing dielectric screening proper¬ 
ties [s^ , by averaging the KS tran sition energies around 
the direct band-gap transition |ll6l| , or b y employing per¬ 
turbative curvature considerations ina. 

But must conventional semi-local functionals really be 
abandoned as far as band gap prediction is concerned? 
Recently, this question has received some a ttent ion. For 
finite systems, Andrade a nd A spuru-Guzik |ll8l | and Gi- 
dopoulos and Lathiotakis [ll9j| have suggested derivative- 
discontinuity correction schemes based on an e lectro- 
static corr ectio n of the asymptotic potential |l2Cll |. Chai 
and Ghen |l2l| derived a perturbative approach for the 
evaluation of the missing derivative discontinuity. In 
the first order, this perturbative treatment leads to the 
’’frozen orbital approximation” result, discussed lately by 


Baerends and co-workers [l22l| . We have shown that, 
contrary to conventional wisdom, in fact the KS poten¬ 
tial derived from any xc functional possesses a deriva¬ 
tive discontinuity [l23|, whose value emerges naturally 
and non- empirical ly from the ensemble generalization of 
DFT [^ . Il24l4l28j| . These approaches have, so far, been 
applied to atoms and molecules and their potential for 
the solid-state remains unexplored. 

Here, we derive an explicit, closed-form expression for 
the derivative discontinuity, A, of an arbitrary many- 
electron system studied with an arbitrary xc functional. 
This derivation is based on the ensemble generalizat ion o f 
the Hartree and xc energy terms suggested in Ref. [l23j| . 
Furthermore, A is expressed using only quantities associ¬ 
ated with the neutral system, thereby avoiding alteration 
of the electron number. Therefore, the formalism is, in 
principle, applicable to both finite and periodic systems. 
Focusing on the latter, we explore analytically the scaling 
of A with system size. We find that while for the exact 
xc functional A must be independent of system size, for 
standard xc approximations like the EDA the derivative 
discontinuity vanishes. This failure is shown to directly 
related to the failure of EDA in predicting fundamental 
gaps from total energy differences in extended systems. 
These findings are demonstrated by illustrative calcula¬ 
tions. 


II. THE ENSEMBLE APPROACH 

The central quantity we discuss below is the funda¬ 
mental gap, Eg. It is defined as 

Eg=I-A, (1) 

i.e., it is the difference between the ionization energy, I, 
and the electron affinity, A. As I and A involve removal 
and addition of an electron, respectively, in the following 
we analyse in detail the properties of a many-electron 
system with a varying number of electrons. 

At zero temperature, the ground state of a many- 
electron system with a (possibly) fractional number of 
electrons N = Nq + a {Nq £ N and a G [0,1]) is de¬ 
scribed by an ensemble state [2^ 

A = (1 - a)|^'Aro)(5'Aro| + a|^'Aro+i)(TAr(, + i|, (2) 

where IdtArQ+p) is a pure many-el ectro n grou nd state with 
Nq+p electrons and p is 0 or 1 [l55j| . |156| . As a result, 
the ground-state energy E{N) at a fractional A is a linear 
combination of the ground-state energies at the closest 
integer points: 

E{N) = {l-a)E{No) + aE{No + l). (3) 

Therefore, the function E{N) is piecewise-linear (see 
Fig. (Ha) for an illustration). 

The energy obtained in DFT using the KS system with 
the exact xc functional has to repr oduc e the piecewise- 
linear behavior. Janak’s theorem |l29l| states that the 
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FIG. 1: A schematic plot of the total energy E (top) and the 
highest occupied KS energy level Sho (bottom) depending on 
the number of particles N for the exact xc-functional 


*-th KS eigenenergy, Si, equals dE/dfi ~ the derivative 
of the total energy of the interacting system, E, with 
respect to the occupation of the i-th level, fi. Applying 
this theorem, we hnd that with the exact xc functional 
the ho eigenenergy, Sho, is a stair-step function of the 
number of electrons, N (see Fig. [TJb)). 

A non-vanishing fundamental gap in a many-electron 
system indicates a discontinuity in the chemical poten¬ 
tial, namely, the cost of electron insertion and removal 
is different. This physical discontinuity is manifested as 
a mathematical discontinuity exhibited in Fig. [T] From 
the perspective of total energies, / = E{No — 1) — E{No) 
and A = E{No) — E{No + 1) are the slopes of E{N) to 
the left and right of Nq , respectively, which are generally 
different from each other. The gap Eg is then the dif¬ 
ference in these two slopes. From the perspective of KS 
eigenvalues, I = —eho{No) and A = —eho{No + l) (owing 
to the IP-theorem), and then Eg is the magnitude of the 
step in eho{N) at Nq [2^ - 1^ . In other words. Eg is the 
discontinuity of the derivative of the E{N) curve, or the 
discontinuity of the eho{N) curve itself, at Nq. 

Clearly, in order to obtain the fundamental gap from 
total energy differences one has to calculate not only the 
system of interest (with N = iVo), but also its anion 
(^N = Nq + 1) and its cation [N = Nq — 1). Figure [T](b) 
suggests, however, an alternative route: Eg can, at least 
in principle, be derived by analyzing the left and the 


right limits, N —>■ Nq and N —>■ Nq , of the neutral 
many-electron system. 

Consider now the celebrated KS equation Q, given in 
Hartree units by 

-b i;ifs(r)^ ^pi{r) = Siipiif), (4) 

where vks {^ is the KS potential and ^Pi(f) are the KS 
orbitals. What elements of the KS equation may cause 
the “jump” of eho{N) at integer TV, shown in Fig. [iKb)? 
One obvious source is simply the fact that the state 
named ho for TV —>■ TV^ is not the same state as the 
one named ho for TV —>■ Nq , due to the infinitesimal oc¬ 
cupation of the next available energy level, whose eigen¬ 
value is different. However, there is a more subtle second 
source: There is nothing to prevent the KS potential 
itself from exhibitin g an ab rupt “jump” across the in¬ 
teger point [3 Eli 113: Il30l| . Because an infinitesimal 
change in TV across the integerpoint can only change 
the density n(r) infinitesimally [37j, the potential vks {^ 
cannot “jump” by more than a spatial constant. Oth¬ 
erwise, in the limit of Nq the same density would be 
achieved from two potentials differing by more than a 
constant, in direct contradiction of the Hohenberg-Kohn 
theorem [ij. In the complementary energy picture, the 
hrst source mentioned above - change in leading orbital ~ 
results in an abrupt slope change of the KS kinetic en¬ 
ergy, Tks = Y.i The second source - 

the “jump” in the KS potential - creates an abrupt slope 
change in the Hartree-exchange-correlation energy, Ehxc- 

We denote the aforementioned spatial constant by A 
and write: 

+ A. (5) 

Here and below we use the superscripts ^ and ^ to denote 
quantities immediately to the left or to the right of the 
integer point Nq. 

Equation ([S]) has two immediate consequences. Upon 
infinitesimal crossing of the integer point, TVq, from TV|j” 
to Nq , all the KS eigenvalues “jump” by the same quan¬ 
tity, i.e., ef- = ef + A. However, the KS orbitals do not 
exhibit any change: ipfir) = (ff{r). As a special case 
of these statements, and = £:f„ 4- A. 

These simple statements are key to the following deriva¬ 
tion. For the gap we then obtain 

Eg = eho{NQ + 1)- eho{NQ) = - £^„ -b A. 

( 6 ) 

Using the definition of the KS gap, E^^ = efi, — we 
arrive at [I,!!!,[13,[Iig 

Eg = Ef + N. ( 7 ) 

The above expression is an exact result and therefore 
must be obeyed by results obtained from the exact KS 
potential. For any given approximate xc potential, how¬ 
ever, the degree to which Eq. is obeyed may vary, 
depending on the deviation of eho{N) from flatness, or 
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equivalen tly, on the de viation of E(N) from piecewise- 
linearity |ll7l Il3ll - ll39j| . 

As already mentioned, the density is continuous across 
the integer point, i.e., n^{f) = n^{r). Because in con¬ 
ventional (semi-)local approximate xc functionals such 
as the LDA and GGAs, the xc potential is a continuous 
function of the density (and its gradient), it is commonly 
believed that there is no mathematical possibility for the 
KS potential to “jump” and therefore A = 0. It is this 
last statement that we challenge in this work. 

If the interacting-electron system has a fractional elec¬ 
tron number, its corresponding KS system must also 
have a fractional electron number. Therefore, not only 
the ground state of the interacting system, but also the 
ground state of the KS system must unavoidably be de¬ 
scribed in terms of an ensemble, while still being fully 
described by a single KS potential. In analogy to Eq. 
the KS ensemble state must be written in the form 

A^^ = (1 - a)|<^)(<^| + (8) 

where and are pure KS ground states, 

with Nq and Aq -|- 1 electrons, respectively. These pure 
states are Slater determinants formed from the Aq or 
Nq + 1 occupied KS orbitals, obtained from the same 
KS potential, i.e., the two Slater determinants differ only 
in that the Aq -|- 1 one contains one more orbital. The 
KS potential generating them is, generally, neither that 
of the pure Nq system nor that of the pure Aq -|- 1 sys¬ 
tem. Therefore, in contrast to the quantities |4tjVo-i-p) 
used to describe the ensemble state of the interacting 
system, all quantities of the KS ensemble in Eq. may 
generally change with the electron fraction, i.e., are a- 
dependent. We emphasize this by using the superscript 
(a). Ensemble-averaging the many-electron Goulomb op¬ 
erator IE = i '^j^i Ia ~ ™ the KS system, the 

Hartree-exchange-correlation energy fu nctio nal general¬ 
izes to ensembles in the following form |l23j |: 


The density n{r) of the ensemble state is expressed in 
terms of p^\r) as 

n{r) = {I - a)p^^\r) + ap^^\r). (10) 

To obtain Ef.-Hxc[n]^ we construct p^^\r) and p^^\r) 
from the KS orbitals as mentioned above, substitute 
them into the functional Ehxc to obtain E^xciPo^'^] and 
EhxcIPi^'^], take the linear combination of the lat¬ 
ter according to Eq. ®. Note that this procedure is not 
equivalent to constructing the ensemble density n{'r) from 
a linear combination of p^^\r) and p^^\r) (cf. Eq. (fTOll l 
and substituting it into Ehxc, as the latter functional is 
not linear with respect to the density. 

The generalization in Eq. is applicable to any xc 
functional and makes the Hartree and the xc energy com¬ 
ponents explicitly dependent on the KS orbitals and on 
a. However, there may still remain an implicit non-linear 
dependence of Ei,-Hxc[n] on a because the KS orbitals, 
may themselves change with a. Finally, note 
that for pure states, i.e., for a = 0 or 1, the ensem¬ 
ble generalized Ee-Hxc[n] reduces to the pure-state form 
EHxc[n], as expected. 

Because the KS potential, and specifically its behav¬ 
ior around an integer electron number, is central to this 
work, we address it here in detail. Due to the ensem¬ 
ble generalization of the Hartree-exchange-correlation 
energy functional, the KS potential is expressed as 
VKs{r^ = Vextir) + Ve-Hxc[n]{r), where Vext{r) is the 
external potential and Ve-Hxc[n]{r) := SEe-Hxc/^n 
is the ensemble-generalized Hartree-exchange-correlation 
potential. While deriving the latter from Eq. ® we 
emphasize an unusual property of Ei,-Hxc[n]'. it explic¬ 
itly depends on a. Therefore, the ensemble-generalized 
Hartree-exchange-correlation potential reads 



Ee-Hxc[n\ = (1 - a)EHxc[p‘^^^] + aEnxcipi^^^]- (9) 

Here, the index e— signifies that the functional is 
ensemble-generalized, Ehxc[p] is the pure-state Hartree- 
exchange-correlation energy functional, and (r) is de¬ 
fined as p^\r) = namely the sum of 

the squares of the first Nq + p KS orbitals. We stress 
that p^\r) are auxiliary quantities that are not associ¬ 
ated with any physical density, except when N is an inte¬ 
ger. We further emphasize that the ensemble-generalized 
form of Eq. ([5]) is not an ansatz, but rather an inevitable 
consequence of employing the ensemble approach to de¬ 
scribe a KS system of fractional number of particles. If 
the exact pure-state xc functional Exc[n] were to be in¬ 
serted into Eq. Q, the ensemble-generalized total energy 
would have been exactly piecewise linear. Even then the 
auxiliary densities and need not equal the pure- 
state densities of Nq - and Nq + 1-systems). 


Since a[n] = N — floor(A) and N = J ncfir, we 
find 6a/Sn = 1. Therefore, Ve-HxcinKr) is a sum of 
two terms: vo[n] = {dEe-Hxc/da)^ and vi[n]{r) = 
{SEe-Hxc/6n{r))^. Because for fractional N the func¬ 
tional Ee-Hxc is orbital-dependent, via the quantities 
p^\r}, irrespective of the underlying xc functional, the 
potential uijri.](£) has to be treated with the OEP ap¬ 
proach IlsL [m i. The somewhat unusual term vq is 
spatially uniform but a-dependent, and it arises from 
the aforementioned explicit dependence of Ee-Hxc[n] on 
a. 

We focus now on vo, which can be written as 

^ ( dEe-Hxc \ _ / dEe-Hxc \ 

° \ da J „ \ da 

\ / an(r) \ 

/aV da 

( 12 ) 


- / dA 


5E, 
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6n(r) 
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This result is obtained by taking the partial derivative 
{dE^_Hxc/^OL){lPi}^ followed by isolation of Vq. Using 
Eqs. dH) and OT to evaluate the first and the second 
terms on the right-hand side of Eq. (HI, respectively, we 
obtain 

Vo = Ehxc[p^i'‘] - Ehxc[p^^^] - j (fir\Lpho{f)\^vi[n]{r). 

(13) 

for N G {No, No + 1]. For N G {Nq — l,iVo], one has to 
substitute with and p[°‘^ with p^^\ We stress 
that Vo is a well-defined, rather than an arbitrary, poten¬ 
tial shift. It must be taken into account for the ensemble- 
generalized funct iona l, if Sho is to equal dE/dN, i.e., if 
Janak’s theorem [l29j| is to be obeyed. The existence of a 
spatially uniform poten tial shift vo is in agreement with 
earlier studies ll4Cll| . which found that whereas for 
fractional N the KS potential is well-defined, for integer 
N it is defined up to a constant. The latter ambiguity 
in the definition of the KS potential can be removed by 
reaching the integer number of electrons No from below 
(for a discussion, see [H, [2^). 

Note that ug and vi {r) are obtained via different quan¬ 
tities when N G (fVg — l,A^o] and N G {No,No + 1]. 
Therefore, when approaching No from the left and from 
the right, we generally expect to obtain different KS po¬ 
tentials. In other words, we expect vks{^ to change 
discontinuously when crossing an integer number of elec¬ 
trons. As mentioned above, this discontinuity must be a 
spatially uniform constant, A (cf. Eq. dS])). 

The consequences of the generalization presented 
above are schematically depicted in Fig. [21 based o n nu - 
merical results for finite systems presented in Ref. |l23j . 
The ensemble generalization brings £ho{N) closer to the 
desired stair-step form: it becomes more flat for frac¬ 
tional N and the “jump” experienced at int eger N is 
increased by A. As observed by Stein et al. [lI7j| . the 
derivative discontinuity and piecewise-linearity of the to¬ 
tal energy are two sides of the same coin: A missing 
derivative discontinuity must be accompanied by devia¬ 
tion from piecewise linearity, and vice versa. Therefore, 
improvement in the description of £ho{N) inevitably re¬ 
flects on the total energy curve: the spurious convexity 
of E{N) is significantly reduced, bringing it closer to the 
desired piecewise-linear behavior, and the abrupt change 
of slope near the integer points is better repro duced . Im¬ 
portantly, the numerical results given in Ref. |l23j | show 
that for ions of atoms and small molecules ensemble- 
generalization of the local spin-density approximation 
(LSDA) does indeed yield fundamental gaps in much 
better agreement with experiment than standard LSDA 
calculations. For example, for HJ, a gap of 5.80 eV pre¬ 
dicted with LSDA is increased to 17.96 eV with ensemble- 
LSDA, reducing the discrepancy with respect to experi¬ 
ment from 70% to 8%. For C+, LSDA predicts a gap of 
0.26 eV, which is increased to 15.31 eV with ensemble- 
LSDA, reducing the discrepancy with experiment from 
98% to 17%. 

Nonetheless, for an approximate xc functional the 
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FIG. 2: A schematic plot of the total energy E (top) and 
the highest occupied KS energy level £ho (bottom) as a func¬ 
tion of the number of electrons, N, obtained with the exact 
functional (solid line), with an approximate functional (e.g. 
the LDA; squares) and with the ensemble generalization of 
the approximate functional (rhombi). The KS gap, the 
ensemble-corrected gap, E^^ + A (Eq. (0), and the exact 
gap, I — A (Eq. 0), are denoted on the figure. 
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ensemble-corrected gap Eg = E^^ -|- A, still does not 
exactly equal I — A. The difference that remains is due 
to a deviation of eho{N) from flatness, which is attributed 
to the implicit non-linear dependence of an approximate 
Ef,^Hxc[n] on a. 


III. THE DERIVATIVE DISCONTINUITY A 

In this section, an analytical expression for the deriva¬ 
tive discontinuity A is derived by taking the limits N -A 
Nq and N —>■ Nq . First, let us introduce some notation. 
When N -A Nq , i.e. a —>■ 0+ (the ^ limit), the quantity 

is termed no, and the quantity pf'^ is termed ni. 
Note that because </^f(E) = ipf{r), these quantities are 
continuous when crossing No', for this reason they do not 
receive the index Also note that the energy Ef,-Hxc[n] 
and the density n were defined in Eqs. ([5]) and (fTUl) for 
the case when N G [No, Ag -|- I], i.e. to the right of the 
point Nq. In the region [Nq — I, Nq], which is of interest 
here as well, these quantities are defined similarly, sub¬ 
stituting /5 q“^ with and with p^^. Furthermore, 
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when N ^ Nq , i.e. a —>■ 1 (the ^ limit), the quantity 
is denoted n_i. 

We now focus on A = — v^nxci^’ which 

we choose to express as A = Aq + Ai. Here 

Ao = v^ - (14) 

and 

Ai = nf (f^ - uf (f). (15) 

Reaching the point N = Nq from the left, we obtain from 
Eq. IlSl) 

Vq = EHxclm] - EHxc[n-i] - / d^r\py{r)\'^vyno]{f). 

(16) 

Reaching Nq from the right, we obtain similarly 

Vq = Enxcini] - Enxcino] - j d^r Whoif)?'>^i['no]{r)- 

(17) 

Recalling that and using Eqs. (fT51) 

and CZl), we rewrite as 

Vq = EHxc[ni]-EHxc[no\-Ai- i d^r |(/9^(r)pnf'[no](r). 

(18) 

When reaching Nq from the left, nf [no](r) = VHxc[no\[r), 
where vhxc = dEuxc/dn is the usual Hartree-exchange- 
correlation potential defined for the pure ground state 
with Nq electrons. 

Finally, subtracting Eq. (fTBl) from Eq. (fT^ and using 
Eqs. (ICTi and (fTO . we can express the discontinuity A 
solely in terms of the ^ quantities, i.e. using only quan¬ 
tities that correspond to the system of interest, with ex¬ 
actly Nq electrons. A is given below, suppressing now 
the index ^ for clarity: 

A = E}jxc\^'\\ “f E}{xc\P‘—'i\ 

+ / d^r VHxc[nQ] (|</5?io(r)P - |<7’z«(f^)P) 

(19) 

Equation dMl is a key result of the current contribu¬ 
tion. It is achieved completely from first principles, 
meaning that no approximations were introduced dur¬ 
ing its derivation. Because the derivation is valid for an 
arbitrary xc functional (exact or approximate), we con¬ 
clude that all xc functionals possess a generally non-zero 
derivative discontinuity A, which is revealed by rigorous 
employment of the ensemble approach in DFT. This in¬ 
cludes, of course, also the simplest xc appr oxim ation - 
the LDA, used for the computations of Ref. [l23j | and in 
calculations presented below. 

Importantly, A as expressed in Eq. (fTOll is obtained 
using only quantities that belong to the original system 
of interest with Nq electrons. Therefore its calculation 
does not require any alteration of the number of elec¬ 
trons in the system. In particular, it is also applicable to 


periodic systems, namely, “jellium” background charge 
corrections do not have to be considered in Eq. [11] be¬ 
cause it is derived from a limit around the equilibrium 
point rather than from actual addition of charges. 

It is well-known (see, e.g., |l4l| l that although the 
band structure of the KS system cannot be rigorously 
related to properties of the interacting system, it never¬ 
theless can serve as an approximation to the charged ex¬ 
citation spectrum of the latter, apart from a rigid shift of 
the unoccupied bands with respect to the occupied ones. 
The corresponding shift is usually introduced empirically, 
or by relying on theories beyond DFT, e.g. many-body 
perturbation theory, and bears the name of the ’’scissors 
shift”. Here, A provides a similar effect, with the impor¬ 
tant difference that it is derived completely within DFT. 

The derivation above was performed within the OEP 
framework. However, it is important to note that the 
calculation of A does not require any actual use of the 
OEP formalism, but requires only simple operations of 
negligible numerical effort with quantities readily avail¬ 
able from a routine DFT calculation. Actual employment 
of the OEP scheme is needed only for calculation of the 
E{N) curve for fractional N. 


IV. THE LIMIT OF AN INFINITELY LARGE 
SYSTEM 

As discussed in the preceding section, Eq. (HD) is ap¬ 
plicable in principle to both finite and infinite systems. 
In this section, we investigate the properties of A for a 
periodic system by considering how it scales with sys¬ 
tem size as the latter approaches infinity. We obtain the 
analytical limiting expression and address its properties 
for both the exact exchange-correlation potential and the 
local density approximation (LDA). 

Consider a many-electron system, whose external po¬ 
tential, Vextiy, is periodic in space, i.e., Vext{r + R) = 
Vextiy, where R is a Bravais lattice vector. Neglecting 
surface effects, all properties of this system, including 
its derivative discontinuity A, can be obtained from the 
limit of a collection of M unit cells as M —>■ oo. Let 
us define some terms that are useful for taking such a 
limit. The total number of electrons the system is MNq, 
where Nq is the (finite) number of electrons per unit cell. 
The electron density is no(r) = The KS 

orbitals Pilf) are, as usual, normalized to 1 when inte¬ 
grating over the whole system, i.e., \pi{r)\'^d^r = 1, 

where the subscript all denotes integration over the entire 
system. Therefore, J^i^nQ{f)d^r = MNq as appropriate. 
Integration over one unit cell yields ^ nQ{r)d?r = Nq 
and ^ \pi{f)\‘^d^r = M~^ —>■ 0, where the subscript 
u.c. denotes integration over one unit cell. We there¬ 
fore define a renormalized KS orbital, Pi{r) = \/Mpi{r)^ 
such that ^ \pi{r)\'^d^r = 1. Like the electron density, 
\pi{ry remains finite for large M. 

To assess the limiting form of Eq. m, we first 
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address 7?^“^ [no + |<PiuP], which can be written as 
^HxcV >‘0 + ^\'^iu\^] using the renormalized orbitals. The 
Hartree-exchange-correlation energy can then be Taylor- 
expanded around no, with 1/M serving as the small pa¬ 
rameter, in the form: 


rpall 

^Hxc 


no + JlWiu? 


- <xc[no\ + ^ jj^r 


^^Hxc 
all '5n(f) 


\Viu{r)\^ 


1 




i3 f ^ ^Hxc 

dr 


\^ium^\^iu{r'r+0 


2 M 2 J^ii J^ii 5 n(f) 5 n{r') 

1 


( 20 ) 


A similar expression can be easily written for [no — 
|</5/toP]- Denoting the Hartree-exchange-correlation ker¬ 
nel by fHxc[n]ir,f') := S'^EHxc/Sn{r)dn{f'), recognizing 
that 5Ehxc/^ nif) = VHxc[n]{f), and using the renormal¬ 
ized orbitals (pi{r) in Eq. dni), we obtain: 


dll 'J dll 

+ |<p.o(r)| 2 |^,o(r')P]+ 0 (^^) ( 21 ) 


The Hartree-exchange-correlation kernel fHxc[n\{f,r') 
can be written as a sum of the Hartree and xc compo¬ 
nents: fHxc[n]{f,f') = fH[n]{r,f') + fa;c[n]{f,f"), where 
fH[no]{r,f') = 5 '^Eh/ 5n{r)5n{r') = l/|r — r'|. Then, 
the Hartree-related term of the derivative discontinuity 
can be expressed as 

d^r\ipj{r)\'^VHjir), ( 22 ) 

where j stands for ho or lu and VHj(f) = 
fall ~ t'\~^ . In the limit of large M (and 

neglecting the diverging term because the “jellium” back¬ 
ground is irrelevant, as explained above), both \(pj{r)\'^ 
and VHj{r) are periodic and remain finite as M —^ oo. 
Therefore the integration can be performed over a unit 
cell, yielding: 


= d^^\ipj{r)\^VHjiT^. (23) 

Therefore, the Hartree-related terms decay as M~^ and 
vanish for the periodic solid. 

The scaling of the xc contribution, is obviously 

much more interesting and it is here that the particu¬ 
lar choice of the xc functional is crucial. For the ex¬ 
act xc functional. Axe is generally expected to be non¬ 
vanishing, because fxc['no]ir,r') is know n to e xhib it di¬ 
vergence (see, e.g., the discussion in Ref. , and 

references therein). The nature of the singularity in the 
exact xc functional, therefore, must be such that for a 
periodic solid Axe obtained from Eq. (I^Tl) is the exact 



Ah 

A 


A 


XC 


FIG. 3: The derivative discontinuity A, as well as its Hartree- 
and xc-components {Ah and A^c, respectively), as a function 
of 1/m, where m is the number of primitive unit cells in the 
supercell, for GaAs 


one. Namely, the scaling for Axe with M as M —>■ oo 
should be ^ M°. In parallel, the xc potential Vxc must 
scale as ^ M°, and the xc energy Exc as ^ M^. 

Unfortunately, this is not the case for simple function¬ 
als such as the LDA. In the LDA, the xc kernel can be 
expressed as fxeif.r') = gxcir)S{f - f'), where gxdr) 
is a function of the density (and therefore periodic in a 
periodic system). As a result, the xc-related terms in 
Eq. (HI]) simplify to 237 Ju.c. i.e. they 
too decay as M~^. Therefore, in the LDA approxima¬ 
tion, for infinite systems A ^ M~^ —?► 0. 

As a practical illustration of the above analysis, we 
used Eq. m to evaluate Ah and Axe in practice, focus¬ 
ing on GaAs as a prototypical example. Briefly, all elec¬ 
tronic structure calculations were per formed using the 


real-space PARSEC package Il44l-ll4 
periodic boundary conditions 


I4g 


_while employing 
We used the 


Perdew-Zunger parameteriz ation 29l \ of LDA with norm- 
conserving n orm-c onserving |l50l | Troullier-Martins pseu¬ 
dopotentials |l57j| . 


To investigate the dependence of Ah, Axe, and A 
on the system size, we calculated these three quantities 
for increasingly large GaAs supercells of Ixlxl, 1 x 1 x 2 , 
1x1x3, 1x1x4, 1x2x2, and 2x2x2 primitive unit cells. Eq. 
m was then used under the assumption that ipho and 
ifiu can be taken as the highest occupied and lo west un- 
occupied orbitals, respectively, of the supercell. [l58j[ In 
other words, the supercell is treated as a finite but topo¬ 
logically periodic system and therefore approaches the 
bulk limit as the number of primitive unit cells, m, ap¬ 
proaches infinity. 

The results are given in detail in Fig. [3| Clearly Ah, 
Axe, and their sum. A, are indeed all linear with 1/m and 
vanish in the limit of a large enough supercell. As the 
LDA-KS gap remains a constant ^ 0.6 eV for all m, in 
the bulk limit the corrected LDA gap simply approaches 
the uncorrected one. Similar trends were obtained for 
several other prototypical semiconductors (e.g.. Si, Ge, 
InP) and are not shown for brevity. 

What is the physical origin for the apparent failure 
of the ensemble-correction for LDA (and indeed for any 

































semi-local functional) in the limit of a periodic solid? To 
understand it, consider again the results of Sec. [Ill in par¬ 
ticular Fig. [21 As shown there, the ensemble correction 
strongly reduces the curvature of the total energy versus 
particle number curve. This greatly assists in bringing 
the fundamental gap deduced from eigenvalue differences 
closer to the one deduced from total energy differences. 
That this would also help improve agreement with experi¬ 
ment hinges on the assumption that the fundamental gap 
deduced from total energy differences is close to the ex¬ 
perimental one. As discussed in Sec. im for small and 
mid-size objects extensive numerical experience shows 
that this is often the case (see, e.g., But for 

the infinite limit, it is in fact known that with the LDA, 
whose xc kernel is not singular, the fundamental gap de¬ 
duced from total energy differences corresponds p oorly 
to experiment and simply approaches the KS gap |15l| 
(For a similar reason, gaps deduced from time-dependent 
LDA also reduc e to the LDA ones in the solid state limit 
- see |l42l . Il52j , and references therein). 

Therefore, the correction corresponding to LDA should 
indeed vanish. From that perspective one could ar¬ 
gue that in the solid-state limit the ensemble correction 
scheme “fails successfully” for LDA, as it yields precisely 
what it was built to deliver - consistency between to¬ 
tal energy differences and eigenvalue differences (both of 
which are, alas, equally wrong). Complementarily, sev¬ 
eral studies have shown that in the bulk limit the LDA 
total energy versus particle number curve is piecewise- 
linear even w ithout ens emble corrections, albeit with the 
wrong slope [m [m. Also from this perspective, A 
must vanish, as there is no curvature to reduce. 

From yet a different perspective, mathematically the 
difficulty arises because the ho and lu orbitals are ex¬ 
tremely delocalized, whereas the LDA xc kernel is ex¬ 
tremely localized. This advocates the importance of 
ultra-non-local kernels. But in lieu of developing new 
functionals, another possibility is to localize the ho and 
lu orbitals. One such localization procedure is the above- 
mentioned dielectric screening based one suggested by 
Chan and Ceder [s^ and others may be envisaged. In 
fact, one could argue that use of a small supercell as in 
Fig.|3]is, loosely speaking, a form of (uncontrolled) local¬ 
ization. Indeed if one were to take the results for the sin¬ 
gle unit cell literally, one would obtain A=0.78 eV which 
would suggest a satisfying (but deceptive) agreement be¬ 
tween the fundamental gap, + A = 1.39 eV and the 
experimental fundamental gap value, 1.51 eV [l54j . A 
similar behavior is obtained for other semiconductors as 
well. This suggests that controlled, physically justified 
localization procedures may prove to be key to system¬ 
atic gap predictions even within LDA. 


We have shown much of the deviation of approximate 
functionals from piecewise linearity is in fact due to the 
lack of an ensemble treatment. We have used this to 
show that all exchange-correlation functionals possess a 
derivative discontinuity, which arises naturally from the 
application of ensemble considerations within DFT, with¬ 
out any empiricism or any further approximations be¬ 
yond the choice of the xc functional. We then expressed 
this derivative discontinuity in closed form using only 
quantities obtained in the course of a standard DFT cal¬ 
culation of the neutral system. We showed that for small, 
finite systems, addition of this derivative discontinuity in¬ 
deed results in a greatly improved prediction for the fun¬ 
damental gap, even when based on the most simple ap¬ 
proximate exchange-correlation density functional - the 
local density approximation (LDA). We then discussed 
the limit of an infinitely large system, so as to approach 
the solid-state limit. We found that the same scheme 
is exact in principle, but results in a vanishing deriva¬ 
tive discontinuity correction when applied to semi-local 
functionals. This failure was shown to be directly related 
to the failure of semi-local functionals in predicting fun¬ 
damental gaps from total energy differences in extended 
systems. Last, we discussed possible future remedies, es¬ 
pecially usage of localization schemes. 
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V. CONCLUSIONS 


In this article, we have revisited the issue of the deriva¬ 
tive discontinuity from an ensemble-DFT perspective. 
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